Model ServingOnline Inference Service
View Online Reasoning Service
Guide for managing online inference services, including how to view basic information, logs, and service details.
View Online Reasoning Service
Prerequisites
- The management console account and password have been obtained.
- An online inference service has been created.
Procedure
- Log in to the management console.
- In the top navigation bar, click Products and Services > AI Computing Platform > AI Computing Platform to go to its overview page.
- In the left navigation bar, select Inference Service > Online Inference Service to enter the Online Inference Service List page.
- On the inference service list page, you can view the basic information of all online inference services in the current platform.
Page Information | Illustrate |
---|---|
Service Name/ID | Service name: User-defined when creating an online inference service. Service ID: Automatically generated by the system. Click the service ID to directly enter the details page of the inference service. |
State | The current status of the inference service, including waiting, creating, running, closed, failed, etc. |
Resource Configuration | When creating an inference service, users select resource specifications. |
Model | The name of the model deployed when creating an inference service. |
Examples | The total number and normal number of Pod instances. The total number refers to the number of instances set by the user when creating an inference service and selecting resource configuration. |
Access Address | The access address of the successfully deployed model, which supports intranet access or extranet access. |
Creation Time | The time when the current inference service was created. |
Update Time | The time when the current inference service was updated. |
Operate | The operations supported by inference services in different states are different, mainly including service details, closing, opening, and deleting. |
- Click the service name/ID of an inference service (or click Service Details in the Operation column) to enter its details page.
- In the Service Information tab, you can view the basic information, instance information, and billing information of the current inference service.
- Select the Service Log tab to view the log information of all instances of the current inference service. You can also view specified log content based on the start and end time or by entering keywords in the search box.