Hi,

we are in a situation where we have a few publications and each mobilink publication works as expected.

Though, when initially synchronizing one of the publication it takes about half an hour before the download of data starts. The second time around it just goes really fast and other publications that download even more data work quite fast as well.

We have seen something like this before when database queries are really slow on the consolidated database. Unfortunatly our DBA cannot see any problems here this time.

We are going to have someone over from Oracle to check the database for problems...

Is there anything else that we should take in account when investigating something like this?

Things we have checked: we've done some network tests to see if things are slow here... but it appears not the consolidated database (oracle) has been monitored for slow queries and locks. non have been found. Load on the database server is very low.

thank you,

asked 03 Dec '14, 09:02

vdcey's gravatar image

vdcey
639212736
accept rate: 33%

1

Just to make that clear: you have a long running download only when a new device (a new remote database) syncs for the first time? After that all publications from the same remote database run okay? Or does this only happen after a restart of the consolidated database? Can you reproduce the behaviour and if so, how?

Kind regards, Chris

(03 Dec '14, 09:43) Chris Werner

That is correct. Only the first time. The strange thing is... I have 3 publications... and the heaviest load is downloaded pretty fast.

(03 Dec '14, 14:09) vdcey

Oracle consultant has done some checks and could confirm what I was seeing in mobilink log. 3 slow download_cursors => 3 slow queries.

The reason why the query was going fast when I was running it manually is because it used a different executionplan because the query in the download_cursor was just a tiny bit differently written (extra spaces or something).

our DBA had noticed a lot of IO but was not sure this was a problem. The oracle consultant confirmed that this really is a problem and the amount of IO was caused of not having a big enough memory Buffer for oracle. Yesterday we added more RAM memory and after half a day of synchronizations everything has come up to speed.

Everyone thanks for the feedback. Upvotes for all !

permanent link

answered 10 Dec '14, 02:19

vdcey's gravatar image

vdcey
639212736
accept rate: 33%

There's a couple of things you need to do to find out where the issue is:

  1. Connect the Mobilink Monitor to your Mobilink instance so that you can see whether the delay is in the fetch of the download data, sending it to the remote, or elsewhere.

  2. Check that your timestamp columns (if you're using timestamps) are indexed properly.

Note that an initial synchronization will download all data to the remote, and the remote will then have to insert all the changes into it's DB, indexing as it goes. I have a publication that takes over 10 minutes for an initial sync to be applied after download, so it's not unusual.

permanent link

answered 03 Dec '14, 12:54

Mike%20Killey's gravatar image

Mike Killey
291169
accept rate: 60%

Thanks for the tip. I guess I'll have to see if I can get more detailed info..

(03 Dec '14, 14:13) vdcey
1

Note that in version 16, the tool that was previously called the MobiLink Monitor was rebranded as the MobiLink Profiler.

(03 Dec '14, 16:33) Bill Somers
Replies hidden
1

FYI, the ML Profiler is more than just a rename. For example, it allows finer grained timing (to the level of sync events or scripts) and sample-based profiling. It also detects blocking in the consolidated database.

(04 Dec '14, 09:57) Graham Hurst

Hi, I'm using version 12. Mobilink monitor reports a very long fetch_download .. I'm assuming this means mobilink is actually getting the data during this period from the consolidated database?

(04 Dec '14, 12:42) vdcey
Replies hidden
1

This is probably caused by either having -cmax (or its alias -cm) set too low, or having a long running download script. Look for warning 10082 in your log file. If it shows up, then bump up -cmax. If it doesn't show up, then you probably have a slow query. There isn't a good, automatic way to find which query is the problem in 12. You can try whatever diagnostic tools Oracle provides, or you can try to infer timings through the MobiLink server log. You'll want to turn on either -vc or -vt to make the server print the SQL that it's executing.

(04 Dec '14, 14:55) Bill Somers

hi,

-cm or cmax is not being used so I assume it's defaulting to 70%, server memory usage is closer to 30%. I've turned on -vt and gonna check if I can find anything, allthough it's difficult since it's a production server and all other user-logs are in between.

(08 Dec '14, 02:31) vdcey

I have a few dozen download-cursors and between 3 of those download-cursors I see a 5 to 7 minute time-laps in the mobilink logs.

Am I correct to assume that the queries in those download cursors are actually taking 5 to 7 minutes to extract the data?

Thank you !

(08 Dec '14, 02:48) vdcey
Replies hidden
1

As stated above, you can try adding -vt to your Mobilink server instance to log the actual SQL that is being used. Do that on a known client and when you've observed the issue, copy all of the SQL out of the log file. Run each query against your consolidated database, and you should get an idea of which script is taking so long to execute.

(08 Dec '14, 04:26) Mike Killey

Ow, I thought I had to add -vt on the configuration of the mobilink service... thanks for the tip.

Edit: looks like I can't use -vt on the client. tried to use -vc but not seeing any queries on the client side (or I don't know where to look)... on the server logs I do see extra information but only what download scripts are being called at what time. I guess I'll just have to investigate those further.

(08 Dec '14, 04:36) vdcey

some extra feedback: After adding -vt to the mobilink configuration on the server. I noticed 3 slow download-cursors. I come to this conclusion because it was taking several minutes before moving to the next download_cursor in the log file. So I looked in to the sql that is behind it.... and executed it directly on the consolidated database and it did not take more then half a second to get the results => 410 row

(08 Dec '14, 08:25) vdcey
showing 5 of 10 show all flat view

If the SQL in the three slow download_cursors runs quickly when executed directly then it suggests a concurrency problem with the scripts (since the MobiLink server has many connections that are simultaneously active). For example, if in the MobiLink Monitor you see long phases that end around the same time (though starting at different times) it indicates that the first one is blocking the others.

FYI with v16, the ML server detects such blocking and the ML Profiler highlights it.

We've also seen syncs against Oracle slow to a crawl due to Oracle "background" processes that become bottlenecks under high load. For example, a customer found that under high sync loads Oracle DB archive logs would fill up, causing a checkpoint and swapping of archive logs, during which synchronizations could not proceed, and there appeared to be no load on the Oracle server. Other culprits might include Oracle streams, database flashback, or other administrative processes.

We've also found deadlock warnings in the MobiLink server log when Oracle DBAs had said there was no deadlock.

permanent link

answered 08 Dec '14, 15:18

Graham%20Hurst's gravatar image

Graham Hurst
2.6k11843
accept rate: 28%

edited 08 Dec '14, 18:33

Your answer
toggle preview

Follow this question

By Email:

Once you sign in you will be able to subscribe for any updates here

By RSS:

Answers

Answers and Comments

Markdown Basics

  • *italic* or _italic_
  • **bold** or __bold__
  • link:[text](http://url.com/ "title")
  • image?![alt text](/path/img.jpg "title")
  • numbered list: 1. Foo 2. Bar
  • to add a line break simply add two spaces to where you would like the new line to be.
  • basic HTML tags are also supported

Question tags:

×278
×27

question asked: 03 Dec '14, 09:02

question was seen: 952 times

last updated: 10 Dec '14, 02:19